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Abstract 

It has been recently established that joint design of content delivery and storage (coded caching) can signihcantly improve 
performance over conventional caching. This has also been extended to the case when content has non-uniform popularity through 
several models. In this paper we focus on a multi-level popularity model, where content is divided into levels based on popularity. 

We consider two extreme cases of user distribution across caches for the multi-level popularity model: a single user per cache 
(single-user setup) versus a large number of users per cache (multi-user setup). When the capacity approximation is universal 
(independent of number of popularity levels as well as number of users, files and caches), we demonstrate a dichotomy in the 
order-optimal strategies for these two extreme cases. In the multi-user case, sharing memory among the levels is order-optimal, 
whereas for the single-user case clustering popularity levels and allocating ah the memory to them is the order-optimal scheme. 

In proving these results, we develop new information-theoretic lower bounds for the problem. 

1. Introduction 

Wireless traffic has been dominated by broadband content access (driven by video applications) and has strained current 
wireless network capacity. While there have been tremendous improvements in wireless data rates over successive generations 
of wireless systems, these gains alone are not projected to keep up with exponential rise in wireless data demand. Pre-fetching 
and storing content in edge caches is one strategy that helps reduce network traffic HI. Recently, it has been shown that joint 
design of storage and delivery (a.k.a. “coded caching”) can significantly improve content delivery rate requirements ||2l. This 
was enabled by content placement that creates (network-coded) multicast opportunities among users with access to different 
storage units, even when they have different (and a priori unknown) requests. Coded caching has been shown to be well-suited 
to next-generation (heterogeneous) wireless network architectures a, 0. 

The setup studied in Q, JS) consisted of single-level content, i.e., every file in the system is uniformly demanded. However, 
it is well understood that content demand is non-uniform in practice, with some files being more popular than others. Motivated 
by this, 0, Q, a, a considered such non-uniform content demand, following different models. 

In 0, a, 0, the setup considered a single user per cache requesting a file independently and randomly according to some 
arbitrary probability distribution that represents content popularity. These works studied the trade-off between the average rate 
and the cache memory. A memory-sharing scheme was proposed in 0, and its achievable rate was characterized. However, 
from our understanding, this scheme was not shown to be order-optimal in generalj^ In ||7l, 0, a different scheme was 
proposed, based on a clustering of the most popular levels. It was shown to be order-optimal for Zipf-distributed content in 
0, and, more recently, for arbitrary distributions in 0. 

By contrast, in 0, a deterministic multi-level popularity model was introduced, where it is assumed that a large number 
of users connect to each cache. Content is divided into discrete levels based on popularity, and, for each level, a fixed and a 
priori known fraction of the users per cache request files from said level. It is easy to see that, when the number of users per 
cache is large enough, this deterministic model will closely approximate an equivalent stochastic-demands model similar to 
0, 0, 0. A worst-case rate-memory trade-off was studied, where the “worst case” is over all user demand tuples that obey 
the constraints set by the model, i.e., where there is a fixed number of users per popularity level per cache. A memory-sharing 
strategy was shown to be order-optimal]^ and a parametric characterization of how much memory to allocate to each popularity 
level was established. It was shown that sometimes it is better to store some less popular content without completely storing 
the more popular content. 

In this paper, we focus our attention on the multi-level popularity model, and we explore the role of the number of users 
per cache by contrasting two extreme cases: a single user per cache with an overall fixed profile of content request across 
all caches (called the “single-user” setup in this paper) versus a large number of users per cache with a fixed content request 
profile for every cache, as in 0 (the “multi-user” setup). We ask for an order-optimality guarantee that is independent of 
the number of popularity levels (in addition to the number of users, files, and caches); a stronger requirement than in 0. 
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^We refer to an “order-optimal” result as one that is within a constant multiplicative factor from the information-theoretic optimum. 

^The approximation was, however, dependent on the number of popularity levels; an aspect that is strengthened in this paper. 


For such an approximation guarantee, we demonstrate that drastically different strategies must be used for each setup. In 
the single-user case, we show that clustering the most popular levels and giving them all the memory, leaving none for the 
rest, is order-optimal; a strategy similar to those proposed in 0, 0. In contrast, the multi-user case requires a complete 
separation of the different levels and a division of the memory between them: the memory-sharing scheme introduced in 0. 
In order to prove these results, we needed to develop new information-theoretic lower bounds for both problems; in particular, 
the multi-user setup necessitated new non-cut-set bounds that use sliding-window entropy inequalities ID, providing much 
stronger approximation guarantees than the results in 0. 

We also discuss the dichotomy between the two setups. We show that the memory-sharing strategy can perform poorly in 
the single-user setup, while clustering can be arbitrarily worse than memory-sharing in the multi-user case. Furthermore, we 
briefly explore an open problem that is part of our ongoing research: that of solving a mixture of the two extreme setups. 

The paper is organized as follows. Section [I^ formulates the problem, describing precisely the multi-user and single-user 
setups. We establish some background in Section |I^ which enables us to state the main results in Section IV Section [V| gives 
an interpretation of the results and the dichotomy in the two setups and briefly explores the mixture of the setups. Detailed 
proofs are given in the appendices. 


II. Setup 

Consider a system where a group of users request files from a server, according to some popularity model. All files are 
assumed to be of size F bits. Prior to any user requests, a placement phase occurs in which information about these files is 
placed in the K access point (AP) caches, of capacity MF bits each. Then, in the delivery phase, users connect to the different 
caches, and each requests a file based on the popularity model; the more popular files are more likely to be requested. The 
server then sends, through the base station (BS), a broadcast message of size RF bits that all the users can hear. The users 
combine the broadcast with the contents of their cache to recover the file that they have requested. Clearly, there is a trade-off 
between the values of M (the “cache memory”) and R (the “broadcast rate”). The larger the caches, the more information 
they can store, and hence the smaller the broadcast needed to serve the requests. 

The popularity model that we consider is the multi-level model. The files are divided into L popularity levels, such that 
all files in a single level are equally popular. The levels consist of Ni,..., Nl files. When a large enough number of users 
are present in the system, we expect a crystallization of the user profile with respect to the popularity levels. Specifically, we 
expect to know, to some degree, the number of users that are making requests from each level. For example, suppose there 
are two popularity levels such that a randomly chosen user is three times as likely to request a file from the first level as he 
is from the second. Suppose there are 40 users in the network. Then, we would expect that about 30 of them will request a 
file from the first level, and 10 from the second. The multi-level popularity model assumes that these numbers are fixed and 
known for every level. The large number of users allows this deterministic model to closely resemble the stochastic-demands 
model. This enables us to analyze the rate-memory trade-off in the worst case, among all cases where user demands respect 
the constraints imposed by this model. 

A pair {R, M) is said to be achievable if there exists a placement-and-delivery strategy that uses caches of memory M and 
transmits, for any possible combination of user requests, a broadcast message of rate at most R that satisfies all said requests. 
Our goal is to And all such achievable pairs. In particular, we wish to And the optimal rate-memory trade-off: 

R*{M) = inf {R : {R, M) is achievable} , 


where the minimization is done over all possible strategies. 

In this paper, we study how the number of users in the system affects its overall behavior. In terms of the setup, the difference 
between a large and small number of users is a difference in the distribution of the popularity levels across the users. For 
illustration, consider again the example above with two levels and a user ratio of 3 to 1, and suppose that there are 20 caches 
in the network. If every cache had just one user connecting to it, then we would expect that about 15 of all the 20 users will 
request files from the first level, and 5 would request files from the second level, giving a situation similar to Fig. However, 
we cannot predict which users will be at which cache. On the other hand, suppose there were 20 users at every cache (for 
a total of 400 users). Then, we would predict that, at every cache, approximately 15 users will request files from the first 
popularity level and 5 will request files from the second. Hence, the request profiles would be roughly similar for all caches, as 
illustrated in Fig. We stress again that the multi-level model is deterministic, and that these predictions are in fact assumed 
as fixed and known realizations. 


A. Multi-user setup 

Consider the setup shown in Fig. [T] For every level i, we have exactly Ui users connecting to every cache and requesting a 
file from i. Notice that every level is represented at every cache. This setup is identical to the one studied in 0. 

We assume the following two regularity conditions. First, for every popularity level i, there are more files than users: 


Vi, N, > KUi. 


(1) 
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Fig. 1. Multi-user setup with K = A caches, and L = 2 levels with {Ui, U 2 ) = (2,1) users per cache. 
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Fig. 2. Single-user setup with K = 4 caches, and L = 2 levels with (Ki, K 2 ) = (3,1) users. 


This can be seen, for example, in video applications such as Netflix, where “files” would be video segments of a few seconds 
to a few minutes. 

Second, we assume that no two levels have similar popularities. The popularity of a level can be written as the number of 
users per file of the level. Hence, if z is a more popular level than j, the regularity condition states: 


UJN, ^ 1 
U,/N, - / 32 ’ 


( 2 ) 


where /3 = 1/80. The reasoning behind this condition is that, if it did not hold for some levels i and j, then we can think of 


them as essentially one level with Ni 
to both Ui/Ni and Uj/Nj. 


Nj files and Ui + Uj users per cache. The resulting popularity 


Ui + Ui 


would be close 


B. Single-user setup 

Consider now the setup in Fig. depicting the other extreme. We have only one user connecting to every cache, for a total 
of K users. The only information known a priori is that, for each level i, exactly Ki out of the K users will request a file 
from i. However, we do not know which users these will be. The achievability scheme that we will design must allow for any 
possible arrangement of the users. 

In this setup, we also assume that we always have more files than users. In particular, for any level i: 


N, > K,. 


(3) 


III. Preliminaries 

Coded caching was introduced in 0, which established its order-optimality for a single level of popularity and a single 
user per cache. The extension to multiple users per cache was looked at in 0, 0, ifTOl and is a special case of ifTTIl : it can 
be formally stated as follows. 


Lemma 1 (Adapted from M12I Lemma 3]). For a single-level caching system with K caches, U users at every cache, N 
uniformly-distributed files, and a cache memory of M, the following rate is achievable: 



K,N,U) = U ■ min 








































Furthermore, this rate is within a constant of the optimum. 


In lO, we developed a memory-sharing scheme for content with multi-level popularity model. The strategy consisted of 
dividing the cache memory between all the L levels, and then treating each level as a separate caching sub-system, with 
the reduced memory. In other words, we give level i a memory UiM, where at S [0,1] and at = 1, and then apply a 

single-level placement-and-delivery strategy for this level on this aiM memory, separately from the other levels. The total rate 
for this scheme is; 

L 

K, {iV„ [/JO = K, N,, [/,). (4) 

An optimization of the memory-sharing parameters {ai}i was studied in a, and an allocation which was demonstrated to 
be order-optimal was established. At a high-level this was done by partitioning the popularity levels into three sets: iT; /; and 
J. The levels in H have such a small popularity that they will get no cache memory. On the opposite end of the spectrum, 
the levels in J will get enough cache memory to completely store all their files in every cache. Finally, the rest of the levels, 
in the set I, will share the remaining memory among themselves, obtaining some non-zero amount but not enough to store all 
of their files. An order-optimal {H,I, J) partition and corresponding memory assignments were established in O. 


Theorem 1 (Adapted from ||3 equation (2)]). Given a multi-user caching setup, with K caches, L levels, and, for each level 
i, Ni files and Ui users per cache, and a cache memory of M, the following rat^ is achievable: 




h^H 






iGl 


where is a particular type of partition of the set of levels called an M-feasible partition^ 


Intuitively, since a level h G H receives no cache memory, all requests from its KUh users must be handled directly from 
the broadcast, which requires the transmission of KUh complete files. The users in set J require no transmission as the files 
are completely stored in all the caches; however, set J does affect the rate through the memory available for levels in I. This 
is apparent in the expression M — Finally, the levels in I, having received some memory, result in a rate that is 

inversely proportional to the effective memory and that depends on the level-specific parameters Ni and Ui. 


IV. Main results 

A. Multi-user setup 

The memory-sharing scheme was proved to be within an approximation factor of 0(T^) of the information-theoretic optimum, 
where L is the number of levels a. In the following theorem, we tighten this gap by developing new, non-cut-set lower bounds, 
which use sliding-window entropy inequalities introduced in ||9) and take into account the contribution of all the levels to the 
rate. With these bounds, we are able to completely remove the dependence on the number of levels L. 


Theorem 2. For all valid values of the problem parameters K, L, {Ni, Ui}i, and M, we have: 


R{M) 

R*{M) 


< 192, 


where R{M) is the rate achieved by memory-sharing, and R*{M) is the optimal rate over all strategies. 


B. Single-user setup 

In the single-user setup, the scheme that we propose is quite different. Instead of separating the levels, we cluster a subset 
of them into a super-level that will be treated as essentially one level. Specifically, we partition the levels into two subsets: H' 
and The set I' will be clustered as one super-level and given the entire memory M, while H' will receive no memory. 

To understand how to choose H' and consider the following rough analysis. Suppose that all levels except one (let us 
call it j) have been split into H' and I'. Then, ignoring level j for the moment, the rate, using Lemma would be; 


M ■ 


Y^Kh 


h^H' 


^This expression of the rate is a slight approximation that we use here for simplicity as it is more intuitive. 
"^See (3 for more details. 






If we were to add level j to H', that would result in the addition of a Kj term, since all Kj requests would be completely 
served by the broadcast. On the other hand, if it is added to then we would get an additional Nj/M term, since the total 
number of files in I' would increase by Nj. Clearly, it is beneficial to choose the smaller of the two quantities. 

Though the above analysis is rough, its main idea still holds. In general, we choose the partition I') as follows: 

I'= {Hy. (5) 

Then, by giving all of the memory to we can apply a single-level caching-and-delivery scheme to obtain the rate in the 
following theorem. 


Theorem 3. Consider the multi-level, single-user setup with L levels, Ni files and Ki users for each level i, and cache memory 
M. Then, the following rate is achievable: 

(M, {iV„ Ky) = Y,Kh +max -1,0 

hGH' ^ 


where H' and V are as in •Si. 


The next theorem shows that this scheme is order-optimal for the single-user setup. 


Theorem 4. In the single-user setup, if R{M) denotes the rate achieved by the clustering scheme and R*{M) denotes the 
optimal rate, then, for all values of the problem parameters L, {Ni,Ki}i and M: 


R{M) 

R*{M) 


< 72. 


This result can be proved using cut-set bounds. 


V. Comparison 

In this section, we first compare the memory-sharing and the clustering strategies, and we explore the dichotomy among the 
two setups that is emphasized by the difference between strategies. We will then discuss why such a dichotomy exists, and 
explain the need for different lower bounds for each setup. Finally, we explore a new problem that combines both setups by 
including both multi-user and single-user levels. 


A. Comparing the two caching-and-delivery strategies 

We have previously argued that memory-sharing is the best scheme to use in the multi-user case, while clustering is the 
near-optimal strategy in the single-user case. However, why could one (or both) of these schemes not be good enough for both 
situations? We will show, in this section, how that is not the case: memory-sharing can give an L-dependent gap between its 
rate and the optimum when used in the single-level setup; meanwhile, the rate achieved by clustering in the multi-user case 
can be arbitrarily far from the optimal rate. We give examples of these two cases. 

Consider a multi-user setup with two levels such that {Ni,N 2 ) = (2®’', 2®'’) and {Ui,U 2 ) = (2^’', 2’’) for some r > 0. 
Suppose that there is enough memory so that both levels are to be partially stored in the caches. With the memory-sharing 
scheme, that would give a rate of approximately: 

(yNim+yrnf i .,9.. 

M M ^ ^ ' 

On the other hand, if we had clustered the two levels into one, then this super-level would have (A^i -l-?V 2 ) files and (?7i -l-t/ 2 ) 
users per cache, resulting in the following rate: 

(Wi+iV2)(C/l+(72) 1 

^- M -= ^ 

Clearly, the latter rate is about 2®’' times as large as the former, a ratio that can get arbitrarily large as r increases. 

This difference is most pronounced when the popularities of the two levels become significantly different (in the above 
example, the popularity ratio was = 2®’'). Intuitively, if the two levels had similar popularities, then memory-sharing 

gives them similar amounts of memory, effectively merging them. However, if their popularities were very different, then they 
should be given highly unequal portions of the memory. 

Consider now the single-user case with L levels, and suppose again that the memory is such that all levels will be partially 
stored. Let us assume that = • • • = Ni^. Using the clustering scheme, we get the following approximate rate: 


?Vi + • • • -f iVi LWi 











However, with memory-sharing, we would get; 

{^/m+■ ■ ■ + L^iVi 

M ~ M ' 

which is larger by a factor of L. Essentially, we are sending L broadcasts, one per level, when we could send just one broadcast 
for all L levels. 

B. Analysis of the dichotomy between the setups 

The dichotomy between the two extremes is striking. They require drastically different strategies, and the strategy that is 
good for one setup is not so for the other. This suggests a fundamental difference between the two setups. 

To understand this difference, consider what happens when sending a coded broadcast message. Each message targets a 
specific subset of users. If, in this subset, there exist two users that are connected to the same cache, then these users have 
access to the exact same side information. As a result, no coding can be done across these two users, and there is hence no 
use in including them in the same broadcast. 

With that in mind, consider again Fig. [T] and Fig. Notice how, in the multi-user setup, there are multiple rows of users, 
each of which consists of users from the same popularity level. Each such row is a complete set of users with no common 
caches; any additional users would have access to the same cache as some other user. Thus, it is sufficient to consider them 
in a broadcast transmission that is separate from all other rows. Since, as a result, no two levels will share the same broadcast 
message, it can only be beneficial to choose the best possible division of the memory, based on popularities. 

In the single-user setup, however, there is only one row of users that contains all the users from all the levels. It is hence 
possible to generate coding opportunities across levels. Merging is thus a better option in this situation, and merging is most 
efficient when all levels receive equal memory per file. 

C. The difference in the lower bounds 

The reason different types of lower bounds are needed for the two setups is similar to the reason for the dichotomy in their 
respective caching-and-delivery strategies. In the single-level setup studied in ||2l, cut-set bounds were given to lower-bound the 
optimal rate. Depending on the value of the rate, a certain number of caches were considered and used in the cut-set bounds. 

When transitioning to the multi-level, multi-user scenario, we get a concatenation of broadcast messages, resulting in a sum 
of single-level rates. Since these rates have potentially different values, each requires a cut-set bound that considers a different 
number of caches. Thus we need lower bounds that consist of sums of cut-set bounds, each considering a different number of 
caches; a single cut-set bound is not enough. 

However, in the single-user case, we are again faced with a single broadcast message to all users. Thus, one cut-set bound 
is enough to give a lower bound on this rate. 

D. Mixing the setups 

So far, we have looked at the two extremes; either all levels were represented at all the caches, or none of them were. A 
natural problem arises; that of studying intermediate cases. The simplest form such intermediate cases can take is one where 
levels of both types are present. 

Specifically, let there be two classes of popularity levels; F and Q. The class F consists of levels i that are represented by 
exactly Ui users at every cache. In contrast, there is exactly one row of users that represents all the levels in the class Q: each 
level i S 0 is represented by Ki of those users. 

The most natural strategy to employ in this situation would be to superpose the multi-user and the single-user strategies. In 
particular, we divide the memory M into yM and (1 — y)M, for some 7 € [0,1]. We give the first part to F and the second 
part to Q, and apply their respective strategies on their part of the memory. We believe this to be the best strategy, but proving 
its order-optimality requires developing new lower bounds that consider levels of both classes at the same time; this is part of 
our on-going work. 
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Appendix A 

Proofs for the multi-user case 

A. Elaboration on the achievability (Theorem 

We here elaborate on the achievability of the multi-user setup, as the details will be important when proving Theorem in 
Appendix A-C The analysis is slightly different from the one in lO. 

As discussed in Section III the strategy involves finding a good partition (i/, J, J) of the set of levels. Below we give such 
a partition. 


Definition 1 (M-feasible partition). For any cache memory M, an M-feasible partition {H,I,J) of the set of levels is a 
partition that satisfies: 

When, M < {l/K)^Nh/Uh; 

Wiel, il/K)y^NjU, < M < (1 + l/K) sjNi/Up, 
yj G J, (1 + l/K) fNj/U, < M, 

where M = (M — Tj-\- Vj)/Si, and, for any subset A of the levels: 

= Ta=Y.Np, VA = Y,NyK. 

i^A i^A iGA 


Such a partition always exists. Furthermore, the set I is never empty as long as M < long as the caches do 

not have enough memory to store everything. 

After choosing an M-feasible partition, we share the memory among the levels as follows: 


yh G H, ahM = 0; 

yi G /, a,M = fN/U, ■ M - Ni/K; 

yj G J, UjM = Nj. 

It can be easily verified that G [0,1] for all i, and = 1- Furthermore, we can also show that aiM < Ni for all i. 

To properly analyze the achievable rate, we need to look more closely at the set I. In the single-level scenario in ||2l, Q, 
three regimes were identified, and they were analyzed separately. These were: when M < N/K, when M > cN for some 
constant c G (0,1), and the intermediate case. We identify three similar regimes for each level in i. Formally, let (3 = 1/80, 
and define: 

/o = {i G / : M < {2/K)sjNyu3j ; 

/i = {i G / : M > (/3 + l/K)s/N~JU^ ; 

/' = /\(/oU/i), (6) 

In other words, /q is the set of levels i such that a^M < Ni/K, Ii is such that aiM > fiNi for all i G Ii, and /' is the rest. 
For convenience, we call the resulting partition {H, Iq, Ii,J) a refined M-feasible partition. Note that, because of regularity 
condition (|^, we have that Ii will, at all times, contain at most one level. 

We will now give upper bounds on the rates achieved individually for each level. 












Lemma 2. Given a refined M-feasible partition {H, Iq, I', Ii, J), the individual rates of the levels are upper-bounded by: 


yh e H, 
yi e /o u 

v^ G /i, 


Vj G J, 

For lack of space, we skip the proof of Lemma 


RhiM) = KUh] 

2SiVlWi 

Ri{M) < V * * 


M-Tj + Vi' 


(i - 


1 TT ^lo + ' 5 '/' 




Rj{M) = 0 . 


/3 yro 


B. Information-theoretic lower bounds 

The lower bounds we use are a sum of cut-set bounds. Each such cut-set bound matches a popularity level; it considers a 
certain number of caches that depends on the individual rate of the level, and then decodes files from just that level, resulting 
in an expression that matches the individual rate of the level. To obtain this sum of different cut-set bounds, the sliding-window 
subset entropy inequality m Theorem 3] is used. 


Lemma 3 (Sliding-window subset entropy inequality 12 Theorem 3]). Given K random variables (Yi,. 
ever s G {1,..., iT — 1}.‘ 


1 

s 


K 




i>)> 


1 

s -f 1 


K 

i=l 


.., Yk), we have, for 


where we define (i) = i if i < K and {i} = i — K if i > K. 


The resulting lower bounds are given in the following lemma. 


Lemma 4. Consider the multi-level, multi-user caching setup. Let b G N"*" and f G {1,..., iT}, and let Si £ {1,, [iT/2<J} 
for every level i. Then, for every memory M, the optimal rate can be bounded from below by: 


L 

R*{M) > min 

i=l 






Proof: Without loss of generality, assume Si < • • • < s^. For every i and m, define ZJ" = ..., where 

Zj denotes the j-th cache. For every i, we consider the t caches Z* as well as b broadcast messages Xj’ = {X^ i,... 

We have; 

1 ^ 

By defining Yi = {Zl, A’/') and rearranging the terms, we can use Lemma to get; 

1 1 if 

bR + tM>---Y,H (z‘, z‘,+t> > • ■ • > > 

i—1 


Thus, we have move from entropy terms with t caches and b broadcast messages to entropy terms with sR caches and sib 
broadcasts. For simplicity, we write these entropy terms as H{sit, sib). By applying a similar process for i increasing from 
1 to L, we can get entropy terms of the form H{sit, sfi). We use each such term to decode a certain number pi of files from 
level i Since there are sfi caches, Ui level-i users per cache, and sfi broadcasts, the total number of files that can be recovered 








is: Pi = min{Sit • Ui ■ Sib,Ni}. Let W* be this set of files. Then: 


bR + tM > 




(“) 1 
> — 

- K 

(fc) 1 
> — 

- K 

(a) 1 
> — 

- K 

(fc) 1 
> 

- K 

> ... 


— H{sit,sib) 

51 ^ 

t —1 

1 ^ 

— (^tT(sit, Si&|w^) +Pi) 
i —1 

. K 

— H{s2t, S 26 |>V^) + — 

1 ^ 

— V iT(s 2 t,S 26 |>V\>V^) + — 

5 2 ^ Si 



P2 

32 


where inequalities marked with (a) use the 


Lemma and those marked with (6) use Fano’s inequality. 


C. Proof of approximate optimality (Theorem^ 

For lack of space, we are unable to give the entire proof of Theorem It consists of several cases that are all analyzed in 
a similar manner, although with different values of certain parameters. We will therefore give one such case in the hopes that 
it will be representative of the remaining cases. This case is one where: K > 96, Ii = 0, and ,J 

Consider the lower bounds in Lemma We will choose the parameters t, b, and Si such that the terms in the sum match 
the upper bounds in Lemma 


Vh G H, 
V*o S Iq, 

Vt G/', 
Vj G J, 


t = h 
Sh = 


Si = 

Sj = 1 ; 


iT/8j; 

(1/16) 

( 1 / 8 ) 


Sr^/NJlfo 

M-Tj +Vi 

Si^MM 


b = 


64 


M-Tj + Vi 
(M-T j + Vif 

s! 


The hrst thing to do is to verify that these parameters satisfy their constraints. The variables t and {sj}j^j trivially do. 
For h G H, we have = [1^/8] > [96/8] > 1 and s^t < K, thus satishes all the constraints. 

For {sijig/s we have, by 

il/8)Si,/NjUi ^ (l/8)y]V/^ ^ ^ 

M-Tj + Vi - {l/K + /3)./N;/Ui- ’ 

and hence s, > 1. Furthermore: 

< w m V W - < imym , ^ 

M-Tj + V, {2IK)yN~Jlh 

Therefore, Si satishes the constraints for i G I'. We can likewise show that, for ig G Iq, we have Si^ > 1 and si^t < iT/2. 






















Finally, regarding the parameter b, we use the fact that there exists some level j G J to say: 

{M-Tj + Vif ^ ^ ' 


64- 




> 64 =MN,IU,>1, 


and hence h>l. 

We must now evaluate the expression in Lemmaj^using these parameters. For convenience, define = mm{sitUi, Ni/sib} 
for each level i. We will lower-bound the value of Ai for every i, which requires evaluating the following comparison: 


bs‘^,t § N,/U,. 

Consider h G H. Using Definition we can evaluate 0: 

(m-Tj + VjY 


(7) 


bsf^t < 64- 
< 64 


Sj 

Uh 


{K/^f ■ 1 


K^/M 


< 


Uk' 


which implies: 

For iQ G Iq, a similar process gives: 

and, for i G I', 

Finally, for j G J, we have: 


Ah > ShtUh = [K/S\ Uh > {K/8 - l)Uh 
> KUh ■ (1/8 - 1/96) = {n/96)KUh. 


> s^otU,„ > (1/24) 
A^ > sAU, > (49/480) 


M-Tj +Vi' 

SiVWJl 
M-Tj + Vi ' 


bsjt = b > 32 


(M-Tj + Vif ^ Nj 


Sj 


- U,' 


again using Definition As a result, we have Aj = Nj/b. 

By combining all the Ai values together, we get the following lower bound on the optimal rate (recall the values of Si and 
Tj from Definition [^: 


R*iM) > J2{n/96)KUh+ (1/24) -^^^ 


h^H 




^(49/480)^'^ 




> (11/96) ^ KUh 


M-Tj + Vi 

^ Nj M 

M -Tj +Vi V 

KJ 

(1/24)S'| M-Tj 


heH 


M-Tj + Vi 


>{ll/9Q)Y,KUh + (im — 

hGH 

M -Tj 

~ 32{M -Tj + ViY/Sj 
>{n/9Q)Y,KUh + {im — 


Sj 


h^H 


M-Tj + Vi 


Sj 


M-Tj + Vi 


1 


Sj 


32 M-Tj + Vi 


= (11/96)^ if t7;4 +(1/96) — 


5 


heH 


M-Tj +Vi 


( 8 ) 





















(9) 


From Lemma 1^ we can see that the achievable rate is bounded by: 

R{M) < Y, KUh 


2S] 


hGH 


M-Tj +Vi 


Combining ([^ with (|^, we get that, in the case we are considering: 

R{M) 

_t_ L < 100 

R*{M) - 

The rest of the proof consists in carrying out a similar procedure for all other cases. In the end, the worst-case gap between 
the achievable rate and the optimal rate is the maximum over the gaps found for each case. 


Appendix B 

Proofs for the single-user case 

A. Proof of the achievability (Theorem^ 

Recall how the memory is divided among the sets H' and defined in 0 : all of the available memory is given to 
which is treated as one super-level. As a result, all requests for files from H' must be handled by a complete file transmission 
from the BS. Since there are users making such requests, the result is the same amount of transmissions. 

For the set now considered as one super-level, we use the single-level strategy from la. Although only a subset of the 
caches is active in our setup, the same strategy still applies. Indeed, the placement in 0 is a random sampling of the files 
in all the caches; we do the same placement in this case. In the delivery phase, we now know the caches to which the users 
connected. We perform a delivery as in 0, assuming that only these caches were every present in the system. 

As a result, the rate required for I' can be directly derived from Lemma 0 . using caches, ^l^s, and 1 

user per cache. In addition, we have, from 0, that M > Ni/Ki for all i S I'. This implies M > (Yiei’ ^'i.)/iYiei' 

and hence the rate for /'is: „ 

n Ya^I' , 

It will be helpful for the later analysis to refine the partition as follows. 

Definition 2. Define the following partition (G, H, /, J) of the set of levels: 

G = {g: M < Ng/Kg and Kg <5 and M < Ng/6} ; 

H = {h: M < Nh/Kh and Kh > 6} ; 

I = {i: NJK, <M< NJ6} ; 

J = {j:M> Ng/6}. 


Furthermore, we rewrite and bound the rate as follows: 


R{M) <YKg+YKh + 

geG heH 




M 


M 


- 1 


( 10 ) 


where [a:]'*' = max{a;,0}. We define Nj = Yj^jR^j’ ™cl upper-bound the last term by: 



Nj/M 

6{1-M/Nj) 

0 


if M < Nj/6-, 
if Nj/6 <M<Nj; 
if M > Nj. 


( 11 ) 


B. Proof of approximate optimality (Theorem^ 

As previously mentioned, we use a cut-set bound to lower-bound the optimal rate. The idea is to send a certain number b of 
broadcast messages Xi,..., Xb that serve certain requests. We choose these requests as follows. For every level i G GLUfUl, 
consider a certain number Si < Ki of caches. These caches are distinct across levels. For all the b broadcasts, the users 
connected to these Si caches will altogether request sfi distinct files from level i if there are that many; otherwise they request 
all Ni files. For the levels in the set J, we collectively consider some sj caches (distinct from the rest). The users at these sj 
caches will use all b broadcasts to decode as many files from the set J as possible, up to sjb files. Let nj denote this number. 











If we let S = s^ + sj be the total number of caches considered, then, by Fano’s inequality: 


bR + SM>H{Zi,...,Zs,Xi,...,Xb) 


> ^min{sj 6 , + nj 
i^J 



i^J 


We will analyze each of the Vi and vj terms separately. We identify two cases for which the analysis is slightly different. 
1) Case M < 1/6.' When M is this small, we choose 6=1 broadcast message. Notice that, because of regularity condition 
Q, we have Ni/Ki > 1 > 1/6 > M. 

The achievable rate in this case can be upper-bounded by: 


R{M) <J2k,. 


Consider now any level i. Let Si — Ki. Then, 


/ . I T I M 

= Ki ^min |l, ^ - M 

> i5/6)K,. 


We can combine (fT^ with ( [T3| l and (fT^ to get: 

R(M) ^ 6 

R*{M) - 5' 

2) Case M > 1/6.' We will now choose b = \6M~\ > 1. 

a) Bound for g G G: Consider Sg = 1. Then, 

I, «. 1 « 

1 M 

- 2 ~ f3M 
_ 1 

“ 3’ 

because Sgb = \6M~\ < 2 ■ 6 M < 2Ng. 

b) Bound for h € H: Consider Sh = [iT/i/ 6 ] > 1. Then, 

Nn \ M\ 1 

6 V I 4,KhM) 6M) 72 

because Shb = [iF/i/ 6 ] • [ 6 M] < AKhM < ANh- 

c) Bound for i G I: Consider Si = [A// 6 M]. Then, 


. iV, / .1 iV, 

— - 1 


6M 




AK 



1 ^ 
72 ’ M’ 


( 12 ) 


(13) 

(14) 


(15) 


(16) 


(17) 


because Sib = [lVi/ 6 M] • \6M~\ < ANi. 






d) Bound for J: First, if M > Nj, then the set J contributes nothing to the upper bound on the rate in ( fTOl i; see O- 
Thus we can ignore it, i.e., say vj > 0. 

So the interesting case is M < Nj. Here, we must decode files from multiple levels collectively. Consider sj = \Nj/QM^, 
where Nj = Notice that there are enough users and broadcasts to decode all files, because: 

Nj 

s jb > -• 6M = Nj. 

However, we must take care that no broadcast considers more than Kj users at a time for any j S J. This can be ensured: 
since there are b = \6M~\ broadcasts, and b > Nj for all j G J, then every broadcast need only consider at most one user per 
level. Hence, all of the Nj files can be decoded, and nj = Nj. 

If M < Nj/6, we have: 


vj = 


> 


Nj - SjM 


1 


Nj- 


> 


12M 
144 Nj 
TT ' Af ■ 

If Nj/6 < M < Nj, then sj is actually equal to 1, and: 

Nj-M 

h 

Nj-M 


Nj 

12M 


vj = 


> 


12M 




12 


Nj 


M 


(18) 


(19) 


3) Multiplicative gap: By combining ( [TS] ), ( [Thl l, ( |l7] i, ( |T8] l and with ( [Tol l ( [TT] i, and also taking into account ([T^, 
we get: 


R{M) 

R*{M) 


< 72, 


which concludes the proof of Theorem [^ 








